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ABSTRACT 

In this work we present a measurement study of user mobil- 
ity in Second Life. We first discuss different techniques to 
collect user traces and then focus on results obtained using a 
crawler that we built. Tempted by the question whether our 
methodology could provide similar results to those obtained 
in real-world experiments, we study the statistical distribu- 
tion of user contacts and show that from a qualitative point 
of view user mobility in Second Life presents similar traits 
to those of real humans. We further push our analysis to 
line of sight networks that emerge from user interaction and 
show that they are highly clustered. Lastly, we focus on the 
spatial properties of user movements and observe that users 
in Second Life revolve around several point of interests trav- 
eling in general short distances. Besides our findings, the 
traces collected in this work can be very useful for trace- 
driven simulations of communication schemes in delay tol- 
erant networks and their performance evaluation. 

Categories and Subject Descriptors 

C2.1 [Computer-communication Networks]: Net- 
work Architecture and Design; J. 4 [Computer Appli- 
cations] : Social and Behavioral Science 

General Terms 

Measurement, Human Factors 

Keywords 

Contact times, Spatial Distribution, Line of sight net- 
works 

1. INTRODUCTION 

This work is motivated by prior studies on human mo- 
bility performed in real life. For example, [4-6] conduct 
several experiments mainly in confined areas and study 
analytical models of human mobility with the goal of 
assessing the performance of message forwarding in De- 
lay Tolerant Networks (DTNs). Each user taking part 
in such experiments is equipped with a wireless device 
(for example a sensor device, a mobile phone, ...) run- 
ning a custom software that records temporal informa- 



tion about their contacts. Individual measurements are 
collected, combined and parsed to obtain the temporal 
distribution of contact times. 

In this paper we present a novel methodology to cap- 
ture spatio-temporal dynamics of user mobility that 
overcomes most of the limitations of previous attempts: 
it is cheap, it requires no logistic organization, it is 
not bound to a specific wireless technology and can 
potentially scale up to a very large number of partici- 
pants. Our measurement approach exploits the tremen- 
dous raise in popularity of Networked Virtual Envi- 
ronments (NVEs), wherein thousands of users connect 
daily to interact, play, do business and follow university 
courses just to name a few potential applications. Here 
we focus on the SecondLife (SL) "metaverse" [3] which 
has recently gained momentum in the on-line commu- 
nity. 

Our primary goal is to perform a temporal, spatial 
and topological analysis of user interaction in SL. Prior 
works that attempted the difficult task of measuring 
and collecting traces of human mobility and contact op- 
portunities are restricted by logistic constraints (num- 
ber of participants to the experiments, duration of the 
experiments, failures of hardware devices, wireless tech- 
nology used). In general, position information of mobile 
users is not available, thus a spatial analysis is difficult 
to achieve [5]. Some experiments with GPS-enabled 
devices have been done in the past [7, 8] , but these ex- 
periments are limited to outdoor environments. 

In this paper we discuss two monitoring architectures 
that we tested and focus on the most robust technique, 
which is based on a custom software module (termed 
a crawler). Our crawler connects to SL and extracts 
position information of all users concurrently connected 
to a sub-space of the metaverse: all results presented in 
this paper have been obtained with this architecture. 

One striking evidence of our results is that they qual- 
itatively fit to real life data, raising the legitimate ques- 
tion whether measurements taken in a virtual environ- 
ment present similar traits to those taken in a real set- 
ting. Our methodology allows performing large exper- 
iments at a very low cost and generate data that can 
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be used for trace-driven simulations of a large variety 
of applications: the study of epidemics and information 
diffusion in wireless networks are just some prominent 
examples. 

2. MONITORING ARCHITECTURES 

Mining data in a NVE can be approached under dif- 
ferent angles. The first architecture we discuss exploits 
SL and its features to create objects capable of sens- 
ing user activities in the metaverse. However, there are 
several limitations intrinsic to this approach that hin- 
der our ultimate goal, which is to collect a large data 
set of user mobility patterns. These limitations mostly 
come from inner design choices made by the developers 
of SL to protect from external attackers aiming at dis- 
rupting the system operation. The limitations incurred 
by the first approach can be circumvented by building 
a crawler that connects to SL as a normal user. 

The task of monitoring user activity in the whole 
SL metaverse is very complex: in this work we focus 
on measurements made on a selected subspace of SL, 
that is called a land (or island). In the following we 
use the terminology target land to indicate the land we 
wish to monitor. Lands in SL can be private, public or 
conceived as sandboxes and different restrictions apply: 
for example private lands forbid the creation and the 
deployment of objects without prior authorization. 

We now detail the monitoring architectures we inves- 
tigated in our work. 

A sensor network architectural]: Our first approach 
has been inspired by current research in the area of wire- 
less sensor networks: it resembles to what one would do 
in the real world to measure physical data (tempera- 
ture, movements, etc ...) by deploying sensor devices 
in the area to be monitored. We built virtual sensors 
using the standard object creation tool accessible from 
a SL client software. Our sensors collect data and com- 
municate with an external web server that stores the 
location information of users connected to the target 
land. The functionality of a sensor is defined using a 
proprietary scripting language [2]. 

A key limitation imposed by the infrastructure of SL 
is that sensors cannot be arbitrarily deployed on any 
land. While it is impossible to deploy objects on private 
lands without authorization, objects on public lands ex- 
pire after a predicted lifetime, which is land dependent. 
To deal with the restricted object lifetime, our system 
replicates all sensors in the same position at regular 
time intervals. 

When a sensor is deployed on the target land, it de- 
tects users (a maximum of 16 users can be detected 
at the same time) that are within the sensing range (96 
meters) with a tunable periodicity and stores this infor- 

1 This approach has been used also in [9]. 



mation in its local cache (16KB is the maximum storage 
space). Due to its limited memory, a sensor initiates a 
connection with our web server and flushes its mem- 
ory using the HTTP protocol as soon as the maximal 
capacity has been reached. The technical specification 
of a sensor imposes several challenges that hinder the 
task of covering an entire land. Moreover, the num- 
ber of HTTP messages that can be exchanged between 
sensors and the web server is restricted by the SL in- 
frastructure: this limits the quantity of data that can 
be retrieved from our sensors, hence a tradeoff exists 
between the granularity of the sensed data and the du- 
ration of a monitoring experiment. 
Monitoring using an external crawler: An alter- 
native approach is to build a custom SL client soft- 
ware (termed a crawler) using libsecondlif e [1]. The 
crawler is able to monitor the position of every user 
using a specific feature of libsecondlif e that enables 
the creation of simple maps of the target land. Measure- 
ment data is stored in a database that can be queried 
through an interactive web application^. The crawler 
connects to the SL metaverse as a normal user, thus 
it is not confined by limitations imposed by private 
lands: any accessible land can be monitored in its total- 
ity; the maximum number of users that can be tracked 
is bounded only by the SL architecture (as of today, 
roughly concurrent 100 users per land); communication 
between the crawler and the database is not limited by 
SL. 

During our experiments, we noted that introducing 
measurement probes in a NVE can cause unexpected 
effects that perturb the normal behavior of users and 
hence the measured user mobility patterns. Since our 
crawler is nothing but a stripped-down version of the 
legacy SL client and requires a valid login/password to 
connect to the metaverse, it is perceived in the SL space 
as an avatar, and as such may attract the attention of 
other users that try to interact with it: our initial exper- 
iments showed a steady convergence of user movements 
towards our crawler. To mitigate this perturbing effect 
we designed a crawler that mimics the behavior of a 
normal user: our crawler randomly moves over the tar- 
get land and broadcasts chat messages chosen from a 
small set of pre-definied phrases. 

3. MEASUREMENT METHODOLOGY 

Using the physical coordinates of users connected to 
a target land, we create snapshots of line of sight com- 
munication networks: given an arbitrary communica- 
tion range r, a communication link exists two users 
Vi,Vj if their distance is less than r. In the following 
we use a temporal sequence of networks extracted from 
the traces we collected using our crawler and analyze 

2 Access to the application can be requested via mail to the 
authors. 
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contact opportunities between users, their spatial dis- 
tribution and graph-theoretic properties of their com- 
munication network. 

A precondition for being able to gather useful data is 
to select an appropriate target land and measurement 
parameters. Choosing an appropriate target land in the 
SL metaverse is not an easy task because a large num- 
ber of lands host very few users and lands with a large 
population are usually built to distribute virtual money: 
all a user has to do is to sit and wait for a long enough 
time to earn money (for free). In this work, we man- 
ually selected and analyzed the following lands: Apfel 
Land, a german-speaking arena for newbies; Dance Is- 
land, a virtual discotheque; Isle of View, a land in which 
an event (St. Valentines) was organized. These lands 
have been chosen as they are representative of out-door 
(Apfel Land) and in-door (Dance Island) environments; 
the third land represents an example of SL events which 
supposedly attract many users. In this paper we present 
results for 24 hours traces: while the analysis of longer 
traces yields analogous results to those presented here, 
long experiments are sometimes affected by instabilities 
of libsecondlif e under a Linux environment and we 
decided to focus on a set of shorter but stable measure- 
ments. A summary of the traces we analyzed can be 
defined based on the total number of unique users and 
the average number of concurrently logged in users: Isle 
of View had 2656 unique visitors with an average of 65 
concurrent users, Dance Island had 3347 unique users 
and 34 concurrent users in average and Apfel Land had 
1568 users and 13 concurrent users in average. 

We launched the crawler on the selected target lands 
and set the time granularity (intervals at which we take 
a snapshot of the users' position) to r = 10 sec. We se- 
lected a communication range r to simulate users equipped 
a bluetooth and a WiFi (802.11a at 54 Mbps) device, 
respectively r\> = 10 meters and r w = 80 meters. In this 
work we assume an ideal wireless channel: line of sight 
networks extracted from our traces neglect the presence 
of obstacles such as buildings and trees. 

User location in SL is expressed by her coordinates 
{x, y, z} which are relative to the target land whose size 
is by default 256 ■ 256 meters. However there is one 
exception: when a user sits on an object (e.g. a bench) 
her coordinates are {x = 0, y = 0, z = 0}. In the target 
lands we selected users did not sit. 

3.1 Temporal analysis 

The metrics we use to analyze mobility patterns are 
inspired by the work of Chaintreau et. al. [4] and al- 
low the analysis of the statistical distribution of contact 
opportunities between users: 

• Contact time {CT): is defined as the time interval 
in which two users (v% , Vj ) are in direct communi- 
cation range, given r; 



• Inter-contact time (ICT): is defined as the time 
interval which elapses between two contact periods 
of a pair of users. Let 

[t(vi,Vj)s> ^(vi,Vj)e\^ \P{vi,Vjj)si t(vi,Vj)e\i ■ ■ ■ i^(vi ,vj )s ' ^(vi,Vj)e\ 

be the successive time intervals at which a contact 
between user Vi and Vj occurs; then, the inter- 
contact time between the k — th and the th 
contact intervals is: 

1{ ~ jJ -(v i ,v j ) — L (v z ,v ] )s L {vi,Vj)e 

• First contact time (FT): is defined as the wait- 
ing time for a user Vi to contact her first neighbor 
(ever). 

3.2 Spatial analysis 

We present here the metrics we used to perform the 
spatial analysis of our traces: 

• Node degree: is defined as the number of neigh- 
bors of a user when the communication range is 
fixed to r; 

• Network diameter: is computed as the longest 
shortest path of the largest connected component 
of the communication network formed by the users. 
We used the largest component since, for a given 
r, the network might be disconnected; 

• Clustering coefficient: is defined as in [10]: we 
compute it for every user and take the mean value 
to be representative of the whole communication 
network; 

• Travel length: for every user m we compute the 
distance covered from its login to its logout coor- 
dinates in SL; 

• Effective Travel time: for every user m we com- 
pute the total time spent while moving; hence, this 
metric does not include pause times; 

• Travel time: for every user Vi we compute the 
total connection time to the SL land we monitor 
with the crawler; 

• Zone occupation: we divided lands in several 
square sub-cells of size LxL and computed the 
number of users in every sub-cell, when L = 20 
meters. 

4. RESULTS 

We now discuss the results of our measurements for 
the three selected target lands and study the influence 
of the communication range (rt, or r w ). 
Temporal Analysis: Fig. Q] illustrates the distribution 
of the temporal metrics we used in this work for = 10 
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Figure 1: Temporal Analysis: Complementary CDF of contact opportunity metrics for three target lands. 



meters and r w = 80 meters. A glance at the comple- 
mentary CDF (CCDF) of the contact time CT indicates 
that the median contact time is roughly 30, 60 and 100 
seconds respectively for Apfel Land, Isle of View and 
Dance Island when r — rb, and about 70, 200 and 300 
seconds for the same set of islands when r = r w . Be- 
sides the intuitive result which indicates larger transfer 
opportunities for larger r, we observe that the distri- 
bution of CT has two phases: a first power-law phase 
and an exponential cut-off phase that limits the CT to 
a few hundreds seconds. 

Similar observations can be done for the CCDF of 
the inter contact time ICT: for the three target lands 
we analyzed, the distribution follows a first power-law 
phase, followed by an exponential cut-off phase. The 
median ICT is around 400 seconds for the two open- 
space lands and between 700 and 800 seconds for the 
Dance Island. Analyzing the same trace of user move- 
ment yields surprisingly similar results with different 
communication ranges. We believe this result is due to 
the fact that users are concentrated around point of in- 
terests (as discussed below) , but it would be interesting 
to compare such findings with real-world experiments. 

Although the distribution of contact opportunities 
appears to be similar for the two open-space lands, the 
CCDF of the first contact time FT illustrates some dif- 
ferences between these lands: in Apfel Land users have 



to wait for a long time before meeting their first neigh- 
bor. The median FT is around 300 seconds for Apfel 
Land, while it is less than 20 seconds for the other two 
lands when r = rt,. The FT improves a lot when in- 
creasing r: the median is around 30 seconds for Apfel 
Land and less than 5 seconds for the other lands. 

These results are quite surprising: from a qualitative 
point of view, we obtained a statistical distribution of 
contact opportunities that mimics what has been ob- 
tained for experiments in the real world [5,7,8]. Obvi- 
ously, human activity roughly spans the 12 hours inter- 
val, while even the most assiduous user which we were 
able to track spent less than 4 consecutive hours on SL, 
hence a quantitative comparison is not immediate. 
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Figure 3: Spatial distribution of users. 
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Figure 2: Line of sight networks: graph theoretic properties for three selected target lands. 



Line of sight networks: We now delve into a de- 
tailed analysis of the communication networks that emerge 
from user interaction when we assume them to be equipped 
with a wireless communication device covering a range 
r e {rb,r w }. Fig. [2] illustrates the aggregated (over the 
whole measurement period) CCDF of the node degree, 
the aggregated CDF of the network diameter and clus- 
tering coefficient. 

The node degree CCDF illustrates a diverse user be- 
havior in each target land: for Apfel Land we observe 
that 60% of users have no neighbors, for the Dance Is- 
land only 10% of users have no neighbors while in the 
Isle of View, all users have at least one neighbor when 
r = r&. When the communication range is set to r — r w 
all users have at least one neighbor in all lands. The 
maximum degree and the whole distribution varies a 
lot between target lands: the main reason lies in the 
physical distribution of users on a land. In Apfel Land 
users are relatively sparse while in the Dance Island, for 
example, most of the users spend most of the time in 
a tiny portion of the land: this observation is corrobo- 
rateqj by our study on the spatial distribution of users 
as shown in Fig. [3] Although the general trend for all 
target lands we inspected is that a large fraction of the 
land has no users, some lands (e.g. Dance Island) are 
characterized by hot-spots with several tens of users. 

3 There is an intuitive reason for this phenomenon: in a dis- 
cotheque users spend most of their time on the dance floor 
or by the bar, while in an open space users are generally 
located more sparsely. 



The CDF of the network diameter illustrates the im- 
pact of different transmission ranges: it is clear that 
the diameter shrinks for r = r w . We note, however, 
that for Apfel Land there is an apparent contradiction: 
for r = rb the maximum diameter is smaller than for 
r = r w . This phenomenon is due to the fact we com- 
pute the diameter of the largest connected component 
of the temporal graph formed by users: when the radio 
range is small (and users are scattered through the tar- 
get land) we observe the emergence of relatively small 
connected components, whereas for larger ranges the 
connected component is large (eventually it includes all 
users), hence a larger diameter. 

In Fig. [3] we also plot the CDF of the clustering coef- 
ficient for the whole measurement period. Our results 
clearly point to high median values of the clustering 
coefficient which indicate that the networks we observe 
are not random grapht@: these networks are highly clus- 
tered but, due to the small number of concurrent users 
that can log in to a land and the results on the net- 
work diameter, we cannot claim at this time that the 
graphs that emerge from user interaction have small 
world characteristics. 

Trip analysis: using physical coordinates, we were 
able to study the statistical distribution of the distance 
travelled by users on the three target lands we analyze 
in this paper. Fig. [4] illustrates the aggregate CDF of 



4 Which are usually characterized by a very small clustering 
coefficient [10]. 
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the travel length, the travel time and the login time for 
all users. Fig. [4|c) shows the CDF of the login time: 
in our measurement we observed that the longest log-in 
time for a user was around 4 hours while 90% of users 
are logged in for less than 1 hour. 

Fig. Sfa) provides further hints towards a better un- 
derstanding of user mobility in the selected target lands. 
For a confined area such as Dance Island, the vast ma- 
jority of users travel less than 230 meters (90th per- 
centile) . This observation however applies also for open 
spaces: for Apfel Land, the 90th percentile is around 
400 meters while it grows up to 500 meters for Isle of 
View. There is a small fraction of users who travel a 
very long distance: for the Isle of View, around 2% of 
users travel more than 2000 meters. Fig. 0Jb) ^ s use " 
ful to infer the distribution of the times a user takes to 
travel from her initial point (the first time our crawler 
tracked the user) to her final point (the last time the 
user has been seen on the target land). 

5. CONCLUSION AND FUTURE WORK 

In this paper we discussed a novel methodology to 
perform user profiling that exploits the raising popu- 
larity of on-line communities emerging from user inter- 
action in Networked Virtual Environments. We stud- 
ied the mobility patterns of users connected to Second 
Life using a crawler that extracts at regular time in- 
tervals user position on a target land. Tempted by 
the question whether any similarity can be found be- 
tween our results and measurements performed in the 
real world, we first characterized the statistical distribu- 
tion of contact opportunities among users. Our analysis 
indicated that mobility patterns in a virtual environ- 
ment share common traits, from a qualitative point of 
view, with those in the real world. We further pushed 
our analysis to characterize the spatial distribution of 
users and their mobility behavior: users are generally 
concentrated around points of interest and travel small 
distances in the vast majority of cases. Finally we char- 



acterized the graph theoretic properties of line of sight 
networks emerging from user interaction and found re- 
sults indicating they are highly clustered. 

Our measurements are publicly available and consti- 
tute a useful material for trace-driven simulations of 
a large variety of applications: the study of epidemics 
and information diffusion in wireless networks, the per- 
formance analysis of forwarding schemes in DTNs, etc... 

Is mobility of users in SL representative of real human 
mobility? In our future work we will try and address 
this question from a qualitative point of view. In this 
paper we have constructed a tool that helps answering 
this key question, but we believe that further study in 
the specification of new metrics to define human mo- 
bility are required. Another interesting area of future 
research would be to build the network of "relation- 
ships" among SL users. Based on the "relation graph", 
new questions can be addressed such as the frequency 
and the strength of contact between acquaintances. 
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